696 research outputs found

    On the Use of Perceptual Properties for Melody Estimation

    Get PDF
    cote interne IRCAM: Liao11aInternational audienceThis paper is about the use of perceptual principles for melody estimation. The melody stream is understood as generated by the most dominant source. Since the source with the strongest energy may not be perceptually the most dominant one, it is proposed to study the perceptual properties for melody estimation: loudness, masking effect and timbre similarity. The related criteria are integrated into a melody estimation system and their respective contributions are evaluated. The effectiveness of these perceptual criteria is confirmed by the evaluation results using more than one hundred excerpts of music recordings

    Automatic Piano Transcription with Hierarchical Frequency-Time Transformer

    Full text link
    Taking long-term spectral and temporal dependencies into account is essential for automatic piano transcription. This is especially helpful when determining the precise onset and offset for each note in the polyphonic piano content. In this case, we may rely on the capability of self-attention mechanism in Transformers to capture these long-term dependencies in the frequency and time axes. In this work, we propose hFT-Transformer, which is an automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture. The first hierarchy includes a convolutional block in the time axis, a Transformer encoder in the frequency axis, and a Transformer decoder that converts the dimension in the frequency axis. The output is then fed into the second hierarchy which consists of another Transformer encoder in the time axis. We evaluated our method with the widely used MAPS and MAESTRO v3.0.0 datasets, and it demonstrated state-of-the-art performance on all the F1-scores of the metrics among Frame, Note, Note with Offset, and Note with Offset and Velocity estimations.Comment: 8 pages, 6 figures, to be published in ISMIR202

    Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects

    Full text link
    We propose an end-to-end music mixing style transfer system that converts the mixing style of an input multitrack to that of a reference song. This is achieved with an encoder pre-trained with a contrastive objective to extract only audio effects related information from a reference music recording. All our models are trained in a self-supervised manner from an already-processed wet multitrack dataset with an effective data preprocessing method that alleviates the data scarcity of obtaining unprocessed dry data. We analyze the proposed encoder for the disentanglement capability of audio effects and also validate its performance for mixing style transfer through both objective and subjective evaluations. From the results, we show the proposed system not only converts the mixing style of multitrack audio close to a reference but is also robust with mixture-wise style transfer upon using a music source separation model

    VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance

    Full text link
    Restoring degraded music signals is essential to enhance audio quality for downstream music manipulation. Recent diffusion-based music restoration methods have demonstrated impressive performance, and among them, diffusion posterior sampling (DPS) stands out given its intrinsic properties, making it versatile across various restoration tasks. In this paper, we identify that there are potential issues which will degrade current DPS-based methods' performance and introduce the way to mitigate the issues inspired by diverse diffusion guidance techniques including the RePaint (RP) strategy and the Pseudoinverse-Guided Diffusion Models (Π\PiGDM). We demonstrate our methods for the vocal declipping and bandwidth extension tasks under various levels of distortion and cutoff frequency, respectively. In both tasks, our methods outperform the current DPS-based music restoration benchmarks. We refer to \url{http://carlosholivan.github.io/demos/audio-restoration-2023.html} for examples of the restored audio samples

    Bacteremic pneumonia caused by Nocardia veterana in an HIV-infected patient

    Get PDF
    SummaryDisseminated Nocardia veterana infection has rarely been reported. We describe the first reported case of N. veterana bacteremic pneumonia in an HIV-infected patient. The isolate was confirmed by 16S rRNA sequencing analysis. The patient initially responded well to trimethoprim–sulfamethoxazole treatment (minimum inhibitory concentration 0.25μg/ml), but died of ventilator-associated pneumonia

    Kinematic Analyses of a Parallel-type Independently Controllable Transmission

    Get PDF
    This study proposes a novel design of a parallel-type Independently Controllable Transmission (ICT). The parallel-type ICT can produce a continuously variable transmission ratio and a required angular output velocity that can be independently manipulated by a controller yet not affected by the angular velocity of the input shaft. The proposed parallel-type ICT is composed of two planetary gear trains and two transmission-connecting members. A prototype was built to investigate its kinematic characteristics and verify application feasibility

    Ample Pairs

    Full text link
    We show that the ample degree of a stable theory with trivial forking is preserved when we consider the corresponding theory of belles paires, if it exists. This result also applies to the theory of HH-structures of a trivial theory of rank 11.Comment: Research partially supported by the program MTM2014-59178-P. The second author conducted research with support of the programme ANR-13-BS01-0006 Valcomo. The third author would like to thank the European Research Council grant 33882
    • …
    corecore